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ABSTRACT 

By  generalizing  a  trivial  problem  in  estimation  theory,  some 
interesting  results  have  been  obtained  for  a  class  of  pulse  amplitude 
modulation  (PAM)  communication  channels.  In  particular,  we  consider 
a  linear  channel  with  memory  perturbed  by  additive  noise  which  is  uncorrelated 
with  the  signal  statistics.  It  is  shown  that,  for  specified  second-order 
statistics  and  a  mean  square  error  criterion,  the  Bayes  estimation  rule  for 
the  specific  case  of  zero-mean  Gaussian  signal  and  noise  statistics  is  also 
both  the  best  linear  estimation  rule  and  the  minimax  estimation  rule.  This 
fact  is  used  in  the  theoretical  design  of  sampled-data  and  continuous  time 
receivers  for  such  PAM  systems.  In  the  continuous  time  case  with  an 
infinite  observation  interval  and  stationary  noise  statistics,  the  receiver 
thus  designed  agrees  with  the  optimum  linear  receiver  previously  derived 
for  this  case  by  Tufts  [2]  using  a  very  different  approach;  the  present 
analysis  shows  that  Tufts's  linear  receiver  is,  therefore,  also  the  best 
nonlinear  receiver  in  the  minimax  sense. 
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In  this  report  we  consider  the  consequences  of  the  following  trivial 
estimation  problem. 

Problem  1: 

The  real  random  variable  a  has  the  Gaussian  probability  density 

2 


Pi (a) = 


1 


a 

2b' 


;  -  oo  <  a  <  oo  . 


(1) 


b 

The  real  random  variable  y  is  statistically  independent  of  a  and  has  the 
Gaussian  probability  density 

1 


p2(y) = 


e  ,  -  oo  <  y  <  oo 


(2) 


<r 

The  sum  random  variable  x  =  a  +  y  is  observed.  It  is  desired  to  find  that 
estimation  rule  a(x)  which  minimizes  the  mean  square  error  D  between 
a(x)  and  a  .  This  mean  square  error  is  defined  by: 

oo  oo  00  oo 

D  =  (a(x)-  o)^ =  J  ^  [a(o+y)-a]^Pl(o)p2{y)dody  =  J  J  [a(x)-tt]^Pj{tt)p2(x-a) 

(3) 


-oo  -00 


-00  -00 


Solution: 


It  is  well  known  ([1],  p.  190)  that  the  optimum  estimate  (x)  is  the 


mean  of  a  ,  conditional  \xpon  having  observed  x  ,  namely 

CO 


oo 

1  (x)  =  Jdop(o|x) 


ap2(x-a)pi  (o)do 


-OO 


-00 


oo 

jpj(x.a)p, 


(4) 


(a)da 


•oo 
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Substituting  (1)  and  (Z)  into  (4)  and  performing  the  indicated  computations 
(See  Appendix  I)  yields  the  result: 


(5) 


The  resultant  minimum  value  of  D  found  upon  substitution  of  (x)  from 
(5)  into  (3)  is  (See  Appendix  I): 


We  now  introduce  some  terminology  and  notation  which  will  be 
employed  below  in  discussing  other  estimation  problems  related  to  Problem  1. 

The  expected  value  of  a  function  f  (a,  y)  of  two  real  random  variables 
with  joint  density  p(a,  y)  will  be  denoted  by 


oo  oo 

Ep[f(a,y)]  =  J  J  f(a,y)p(a,y)dady. 
-oo  -oo 


(7) 


We  shall  denote  by  W  the  class  of  all  joint  probability  densities 
p  (  a ,  y )  which  satisfy  the  following  requirements 


E  (a^)  =  b^ 

P 

(8a) 

Ep ( ay )  =  0 

(8b) 

Ep(y^)  =  . 

(8c) 
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Problem  2: 

It  is  known  that  the  joint  density  of  a  and  y  is  an  element  of  W. 

The  sum  random  variable  x  =  a  +  y  is  observed,  and  it  is  required  that  the 
estimation  rule  a(x)  be  linear,  i.  e.  ,  the  only  permissible  estimation  rules 
are  of  the  form  a(x)  =  kx  .  We  seek  that  estimation  rule  a^Cx)  of  this 
form  which  minimizes  the  mean  square  error  D  defined  by 

D(a,  p)  =  Ep  [[a(x)-a]^]  .  (9) 

Solution; 

First  note  that  if  a(x)  is  linear,  then  D(a,  p)  is  independent  of 
which  peW  is  used  to  compute  it,  because 

D(kx,p)  =  Ep  |[k(a+y)-a]^|  =  Ep[(k- l)^a^]  +  [2k(k-l)ay]  +  E^ [k^y^]  , 

from  which  by  use  of  (8)  we  obtain 

D(kx,p)  =  (k-l)^  b^+ k^<r^  =  £(k)  .  (10) 

Setting  f  (k)  =  0  yields  k  =  ,  while  f"  (k)  =  2«r^  >  0  insures  a 

minimvim.  Accordingly,  a2(x)  =  a^(x)  . 

Alternatively,  one  could  argue  as  follows.  Since  the  value  of 
p)  is  independent  of  peW  when  a  is  linear,  we  can  compute  it 
using  any  p  c  W  whatsoever.  In  particular,  suppose  we  were  to  use 

*p(oi,  y )  =  Pj  (“ )  P2(y )  (11) 

where  p^(a)  and  p^iy)  are  given  by  Eqs.  (l)and(2),  respectively; 
one  easily  verifies  that  p  (a,  y)  eW.  Now  Problem  1  tells  us  that  for  any 
a(x)  and,  hence,  a  fortiori,  for  any  linear  a(x)  ,  we  have 
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D(a,  p)  >  ,  p)  =  .  (12) 

Since  is  itself  linear,  we  may  conclude  that  ( x )  =  (x)  . 

The  alternative  method  of  solution  of  Problem  2  proves  useful  in 
solving  the  following  minimax  estimation  problem. 

Problem  3: 

It  is  desired  to  find  the  estimation  rule  a  (x )  which  minimizes  the 
maximum  mean  square  error  that  occurs  when  p(a,  y)  is  varied  over  W, 
i.  e.  ,  our  optimum  estimate  (  x  )  is  defined  by  the  requirement 

(''“<*>)  (p7w  pTw  °<“3-  ”0  ■ 


Solution: 


The  following  argument  establishes  that  a^(x)  =  (x)  .  Since 

(x)  is  linear,  we  know  that  D(a^  ,  p)  =  const.  =  for  all  pcW  and, 
therefore, 

D(S,,p)  =  Dj.  (14) 

On  the  other  hand,  we  know  from  Problem  1  that  Qj  (x)  yields  the  least 
value  of  D  when  p  =  p^ .  Accordingly,  for  all  a  ( x )  we  have 

D(i, p)> D(a,?r)> DiSj,  p)  =  Dj  =  (^^,p) .  (15) 

Comparing  (15)  with  (13)  shows  that  a^ix)  =  o^  (x)  . 

In  summary  then,  we  see  that  in  the  space  W  of  joint  probability 
densities  as  defined  by  (8),  the  least  favorable  distribution  with  respect 
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to  a  mean  square  error  criterion  is  p(a,  y  )  of  equation  (11)  in  which  a 

and  y  are  statistically  independent,  zero  mean  and  Gaussian.  Since  (x), 

the  optimum  estimation  rule  for  p(a,  y )  and  a  mean  square  error  criterion, 

is  linear  in  x  ,  we  have  seen  that  it  is  in  fact  the  solution  to  all  three  of  the 

in 

estimation  problems  we  have  considered. 

We  now  employ  the  knowledge  gained  from  the  above  problems  in  order 
to  solve  some  estimation  problems  regarding  the  pulse  amplitude  modulation 
system  depicted  in  Fig.  1. 


Figure  1:  Block  Diagram  of  PAM  Communication  System 


We  could  have  defined  W  somewhat  more  broadly  and  still  had  Oi  (x ) 
as  the  minimax  solution.  In  particular,  we  could  have  replaced  (8a;  by 
2  2  2 

£  (a  )  <  b  and  (8c)  by  £  (y  )  <  a  ,  and  we  could  have  permitted 


probability  distributions  P(a,  y)  which,  because  of  discrete  concentrations 
of  probability,  possess  no  density  p(a,  y). 
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N 

8  ( t  -  Tj^ )  from  the  random  vector  a  , 
k=l 

the  known  vector  r  and  the  known  function  s  (t).  This  transmitter  output 
is  then  sent  over  a  linear,  dispersive  channel  characterized  by  an  impulse 
response  h(t).  The  receiver  input  x(t)  is  an  additive  combination  of  the 
channel  response  u(t)  to  the  transmitter  output  and  a  sample  function  y  (t) 
of  a  random  process.  The  receiver  operates  on  the  noisy  waveform  x(t) 

A 

to  produce  the  vector  a  of  estimates  of  the  respective  components  of  a  . 

The  design  of  the  receiver  may,  therefore,  be  viewed  as  a  problem  in  statistical 
estimation  theory.  We  start  with  a  relatively  trivial  problem  analogous  to  Problem  1. 


The  transmitter  forms  the  sum 


I 


Problem  4; 

Suppose  that  the  message  vector  a  has  a  zero-mean  Gaussian 
probability  density  of  the  form  ^  ^ 


1  T^-1 

T-  5  i 


(16) 


In  Eq.  (16),  ^  represents  the  i-j”**  element  of  the  inverse  of  the  auto¬ 

correlation  matrix  £  .  Further  suppose  that  y(t)  is  a  sample  function  of 
a  zero  mean,  possibly  nonstationary  Gaussian  random  process  statistically 
independent  of  a  and  having  the  autocorrelation  function  7(t,  s  )  =  y(t)y(s). 
The  receiver  input  x(t)  is  observed  at  M  distinct  time  instants 

>  t^ ,  .  . . ,  tj^  ;  the  observed  random  variables,  therefore,  constitute 


.th 
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an  M-dimensional  colvunn  vector  x  with  components  x _  =  x(t _ ): 

—  mm' 

m  =  1,  2,  .  .  . ,  M.  We  seek  that  estimation  rule  a(x)  which  minimizes  the 
mean  square  error  between  a  and  a  defined  as 
_  N 

D  =  [o(x)-a]^=^  ^  ^  dadx  (x  )  -  oj^p  (o  )q(x  I  o  )  .  (17) 

i=l  * 

The  integration  in  Eq.  (17)  extends  over  the  entire  N  +  M  dimensional  space 
of  which  dadx  =  dXj  .  .  .  da^dx^  .  .  .  is  a  differential  element  of 

volume,  and  q(x  |  a)  is  the  M-dimensional  probability  density  of  x 
for  a  given  message  vector  a  . 


Solution: 

The  signal  component  u(t)  of  the  receiver  input  waveform  x(t) 
is  given  by 


N 


00 


u(t)  ajr(t-Tj)  ,  where  r(t)=j*s(t-f)h(|)d|.  (18) 

j=l 


-oo 


In  order  to  write  down  q(x  |  a)  explicitly,  we  introduce  the  following  notation, 

Let  r  be  the  M  x  M  autocorrelation  matrix  whose  m-n^^  element  is 

Y(t  ,  t  )  =  Y  ,  and  let  y  be  the  m-n^^  element  of  F  .  Let 

* '  m  ’  n  'mn  ’  'mn  - 

N 

u(o)  be  the  column  vector  with  components  u(t  )  =  /  a.r(t  -Tj)  = 

—  —  m  /  y  1  m  l 


N 


j=l 


>c  .a.=  u  (a):m=l,2,  We  then  may  write 

mj  j  m  - '  *  r  t  t  J 

j=l 
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M  M 


q(x|o) 


iriVJ - 


(Zv) 


m=ln=l 

Wr 


IjF^ 


(19) 

It  is  a  standard  result  of  estimation  theory  ([l],pp.  188-190)  that  the 
optimum  estimation  rule  is  the  mean  of  ci  conditional  \q>on  having  observed 
X  ,  the  i^^  component  of  which  is 


ai(x) 


)  q(x|a)  da 


Jq(x|o)  p(o)  da 


(20) 


The  integrations  indicated  in  Eq.  (20)  are  performed  in  Appendix  II  and 
yield  the  result 


“i  (*) 


M 

Z 

m=l 


9.  X 
im  m 


i  =  1,  2,  . . N 


(21) 


The  constants  6^  appearing  in  (21)  are  defined  in  Appendix  II. 

The  most  important  property  of  the  estimation  rule  specified  by  (21) 

is  that,  like  the  optimum  estimate  (x)  of  Problems  1,  2,  and  3,  it  is 

linear  in  the  observed  random  variables  x 

m 

We  now  introduce  some  terminology  and  notation  which  will  be 
employed  below  in  discussing  other  estimation  problems  related  to  Problem  4. 
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The  expected  value  of  a  function  f(a,  of  two  real  random  vectors 
With  joint  probability  density  p(a,  y.)  will  be  denoted  by 


Ep[f(.,ir)l  =  J  J 

f z.)  z.)  ^z  • 


(22) 


We  shall  denote  by  V  the  class  of  (M+N)  dimensional  joint 
probability  densities  p(a,  which  satisfy  the  following  requirements 


E  ( a.  a.  )  =  A., 

p '  j  k'  ^jk 

E  ( a. y  )  =  0 
p  J  ^m' 


E  ( y  y  )  =  y 
p  ^m'^n  'mn  j 


(22a) 

y  j,k=  1,2,  m,n=  1,  2,  ...,  M.  (22b) 

(22c) 


Problem  5: 

Let  a  be  the  N-component  random  message  vector  and  let  )r  be 

M-component  random  vector  whose  elements  are  the  noise  samples 

y(t^)  =  y^.  It  is  known  that  p(a,  y)€  V.  The  M-component  random  vector 

X  whose  elements  are  the  receiver  input  samples  x(t  )  =  x  is  observed. 
— ,  mm’ 

and  it  is  required  that  the  estimation  rule  ^(x)be  linear,  i.  e. ,  the  only 
permissible  estimation  rules  are  of  the  form  a(x)  =  Kx  ,  where  K  is  an 
NxM  rectangular  matrix.  We  seek  that  estimation  rule  of  this  form  which 
minimises  the  mean  square  error  D  defined  by 

N 

D(a,  p)  =  Ep{l«(5)-il^]  =  ^ 

U1 


(23) 
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Solution: 


For  linear  a(x)  we  find  that  D  ( a  ,  p  )  is  independent  of  which 


p  e  V  is  used  to  compute  it,  since  we  have 


N  r-M 

d(Kx.p)=^eJ  J  k, 


X  -o. 
im  m  1 


i=l 

N 


Lm=l 


N 


rM 


i=l 


/  k.  [y  +u  (a)]-a. 
ljn=  1  _ 


D(Kx,  P) 

i=l 

N 

D  (K^  p  )  =  ^  E, 


M 

1  N  . 

zl 

y 

y  +  /  c  ;Cl.  1 

-  a. 

\ 

im 

/m  mj  J  1 

1 

( 

m=  1 

^  j=l  / 

J 

i=l 


k.  y  + 
im  'm 


N  /  M 

ll 


A 


k.  c 
xm  mj 


a.  -  a. 
J  1 


(24) 


j=l  \m=l 

When  the  sqxiaring  operation  indicated  in  (24)  is  carried  out  and  the 
expectation  operation  is  commuted  with  the  sxxmmations,  it  becomes  clear 
from  (22)  that  D(l^,  p)  is  independent  of  peV  .  It  then  follows  by 
reasoning  directly  analogous  to  the  alternative  method  of  solution  of  Problem 
2  that  the  desired  estimation  rule  is  that  already  specified  by  Eq.  (21)  as  the 
solution  to  Problem  4,  i.  e.  ,  the  optimum  estimation  rxxle  has  K  =0  =  i 

the  distribution  analogovxs  to  p(oi,y)  of  Eq.  (11)  is 

1  r  T_-l  .  Tt,-1  1 

■“2‘l.i*  I-l 

T5RTJ - i  — 


piS.,1)  =  - 


(2») 


i*r  III 


1 

2 


(25) 
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Problem  6; 

It  is  desired  to  find  that  estimation  rule  ^(x)  which  minimizes  the 
maximum  assumed  by  D(a,  p)  as  p  is  varied  over  V. 

Solution: 

In  complete  analogy  to  Problem  3,  we  see  that  Eq.  (21)  specifies  the 
desired  estimation  rule. 

In  summary,  the  common  solution  of  Problems  4,  5,  and  6  is  the 
optimum  sampled-data  mean  square  error  receiver  for  the  PAM  system  of 
Fig.  1,  where  the  term  "optimum"  is,  of  course,  to  be  construed  with 
regard  to  the  statistical  knowledge  assumed  and  the  performance  criteria 
adopted  in  these  problems. 

Of  particular  interest  is  the  limit  of  infinitely  large  M  which 
corresponds  to  the  case  of  continuous  observation  of  the  receiver  input. 
Although  it  is  possible  to  perform  the  limiting  operations  directly  upon  the 
form  of  the  as  derived  in  Appendix  II,  it  proves  somewhat  more 

convenient  to  revise  our  method  of  attack  slightly.  (For  background  material 
regarding  the  analysis  that  follows,  the  reader  is  referred  to  [1],  pp.  98-106.  ) 

Suppose  the  receiver  input  x(t)  is  observed  over  the  interval 
<  t  <  .  The  revision  in  our  method  consists  of  replacing  the  vector 

X  of  receiver  input  samples  with  a  vector  of  expansion  coefficients  of 
x(t)  in  terms  of  a  set  of  functions  ^f^  (t)|  orthonormal  over  (T^  ,  T^). 

In  particular,  we  choose  for  our  functions  f^  ( t )  the  orthonormal  solutions 
of  the  Karhunen-Loeve  integral  equation 
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Y(t, .)£„(. )d.  i  T,<t<Tj. 

Thu0,  the  elements  of  x  are 
■"2 

=  I  f^(t)x(t)dt. 


(26) 


(27) 


Under  the  assiimptions  of  Problem  4,  the  expansion  coefficients  for  a 

fixed  message  vector  a  wovild  be  statistically  independent  Gaussian  random 

variables  with  variances  \  and  means  /i  (a)  defined  by 

m  m  — 

"^2  N  /  "^2  \  ^ 

T;  M'  Tj  7  i=l 


Accordingly,  in  place  of  Eq.  (19)  we  obtain 

M  ,2 

1  Y 

■“Z’Z  ■ 

m=l 

q(2£|a)  =  - - 


m  m 


m 


(2jr\  )  2 

'  m  ' 


(29) 


Therefore,  the  computations  done  in  Appendix  II  remain  applicable  if  we 

- 1  ^mn 

replace  c  .  by  f  .  and  7  '  by  — ? -  .  In  the  limit  of  infinite  M, 

r  f  ^TYYi  'mn  ^  \  ’ 


mj 


'mj 


m 


the  vector  x  of  expansion  coefficients  provides  a  representation  of  the 
observed  receiver  input  voltage  x(t),  T^<t<  ,  which  is  sufficiently 

accurate  for  most  practical  purposes.  The  components  (x)  of  our 
optimum  estimate  approach  linear  functionals  of  x(t),  T^  <  t  <  T2  ,  which 
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will  be  denoted  by  [  x  ( t ) ,  Tj  <  t  <  j  i  =  1,  2,  .  .  .  ,  N.  We  now 

characterize  the  more  completely  by  examining  the  limiting  behavior 
o£Eq.  (21). 

Let  6^(t)  be  the  time  function  whose  m^^  Karhxmen-Loeve  expansion 

coefficient  is  9.  X.  ,  i.e.,  let 
im  m  ' 


CO  L 

0.(t)=y  9.  \  I  (t);  9.  \  =  \  f  (t)0.(t)dt. 

i'  Lj  immm''’  imm  Jm''i'' 


(30) 


m=l 


1 


Then,  let  ( t )  be  the  solution  of  the  integral  equation 

T, 


^i  ( t )  =  7  ( t ,  8  )  z.  (  8  )  ds  ;  Tj  <  t  < 


2  • 


(31) 


1 


.th 


It  follow8  that  the  i  component  of  our  optimum  eetimate  may  be  expreesed 
in  the  form 


T,  T,  T- 

OD  oo  2  oo  2  2 


m=l 


m=l  “  T 


m=l  T 


OO  2  -  .  _  - 


1  Tj 

00  „  ^2 


m=l  T 


m=l  T , 


2  00  I 

g,  .  J  d.  z,  (.)  ^  V'm'*)  'J 


m=l 


Zj(s)  x(s)  d8  . 


(32) 
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In  deriving  £q.  (32),  use  has  been  made  of  the  fact  that  7(t ,  s  )  is  a 
symmetric  kernel.  The  equation  indicates  that  may  be  formed  by  passing 
x(t)  through  a  linear,  time-invariant  filter  with  impulse  response  w.(t) 
given  by 


rz.(T-t)  ;  -T2  +  T<t<-Tj  + 


Wj  ( t )  =  •( 


(33) 


elsewhere 


The  filter  output  at  time  T  then  will  be 

-Tj+T 


00  i  z 

yx(T-T) Wj(T)dT  =  Jx(T-T)!e.(T-T)dT=  J  z.(8)x(s)d8  =  g.  .  (34) 


-oo 


-Tj+T 


It  appears  difficult  to  extend  the  general  analysis  beyond  this  point. 

In  the  specific  case  in  which  Tj  -►  -oo  and  T2  -*•  oo  ,  an  explicit  solution  for 
w^(t)  can  be  obtained  if  the  noise  process  is  wide  sense  stationary.  The 
solution  for  this  case  was  found  by  Tufts  [2]  using  a  very  different  approach  . 
Tufts  derived  the  mean  square  error  optimum  linear  receiver  \mder  the 
assumption  that  the  only  statistical  knowledge  available  to  the  receiver 
designer  was  the  message  vector  autocorrelation  matrix  ft  and  the  noise 
autocorrelation  function  y(t,  a)  =  ;  signal  and  noise  were  assumed 

additive  and  independent.  Calculus  of  variations  yielded  an  integral  equation 
the  solution  of  which  was  the  transfer  function  W^(  f )  of  the  optimum 
receiver  filter,  namely 
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j2irf(T  -T) 


OO  GO 

where  W.(f)  =  ^w.  ( t )  e’^^^  dt ,  R(f)  =  J  r  ( t )  e"-’^’^  dt  , 


and  ^f(f)  =  ^  4^(t)  e  <Jt  .  Equation  {35)  represents  a  "matched  filter" 

A  -GO 

( f ) 

’  f  independent  of  i  followed  by  a  delay  line  with  N  taps  spaced  like  the 
transmission  instants  Tj^  and  weighted  by  gain  coefficients  which  vary  with  i  . 
The  optimum  estimate  g^^  is  the  sum  of  the  weighted  tap  outputs  of  time  T 
(see  Fig.  2). 


Fig.2 •Optimum  Receiver  for  Stationary  Noise  and  an  Infinite  Observation  Interval 
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In  Appendix  III,  Eq.  (30)  is  solved  for  -►  -oo  ,  +  oo  and 

stationary  noise,  and  the  solution  is  shown  to  be  equivalent  to  Tufts'  solution 
as  given  by  (35)  and  Fig.  2.  The  present  analysis  shows  that  the  resultant 
receiver  is  not  only  the  best  linear  one,  but  also  the  best  nonlinear  receiver 
in  the  minimax  sense.  For  either  of  these  interpretations  of  the  receiver,  the 
joint  distribution  p[a,  y  (t)  ]  of  signal  and  noise  has  been  assumed  to  belong 
to  the  continuous  analog  of  V  ,  i.  e. ,  it  is  known  that  for  all  j,  k,  =1,  2,  . .  . ,  N 


and  for  all  t^  t^  e  (-co,  oo) 

E  (a.  a,  )  =  i., 
p  '  j  k'  ^jk 

(36a) 

Ep[a.y(tj)]  =  0 

(36b) 

Ep  [y(tj  )  y(t2)]  =7  (tj  ,  t^) 

(36c) 

Conclusion: 

It  has  been  shown  that  in  mean  square  error  estimation  problems 
with  additive,  uncorrelated  noise  and  known  second-order  statistics,  the  Bayes 
estimation  rule  for  the  specific  case  of  statistically  independent,  zero-mean 
Gaussian  signal,  and  noise  statistics  is  also  both  the  best  linear  estimation 
rule  and  the  best  minimax  estimation  rule.  This  fact  has  been  used  to 
design  optimum  sampled-data  and  continuous  time  receivers  for  a  practical 
PAM  communication  system.  In  the  continuous  time  case  with  an  infinite 
observation  interval  and  stationary  noise  statistics,  the  receiver  thus  designed 
agrees  with  the  optimum  linear  receiver  previously  derived  for  this  case 
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by  Tufts  [2]  using  a  very  different  approach;  the  present  analysis  shows  that 
Tufts's  linear  receiver  is,  therefore,  also  the  best  nonlinear  minimax 
receiver. 

The  above  theory  could  be  employed  as  the  mathematical  basis  for  an 
investigation  of  the  extent  of  degradation  in  system  performance  suffered  by 
adopting  a  sampled  data  as  opposed  to  a  continuous  time  receiver.  It  is 
also  hoped  that  the  above  constitutes  a  useful  contribution  to  the  complex  field 
of  the  interrelations  between  different  system  performance  criteria. 
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APPENDIX  I 


v2  2 


1  r(x-o)  j.a  -I  „  1  r®  “2ax  _l  o  ^ 

«  -zi— r-+rz-i  „  ??  -zi— 2 — +-?! 


QO  00  -^[*2 — -1  00 

^ap2(x-a)pj(a)da  ^ae 

CD 


b“_  _2d  (  “  (r“  b‘ 


*\  /  »  -oo 

“iW=ar 


-OO 


da 


Jp2(x-o)pj(a)do  ®  ^  ~I  [- — 9*^  +  ^  ] 


-OO 


.f 


u*-  r  ^  ^ 

<r  b  j  {  O'  b  j 

e  da  \  e  da 


-oo 


-00 


^  i  \  2  d  - 

ai(x)  =  O'  gjffn 


2  " 

1  ra  -6ax  .a  i 

®  -jI-T - 

e  da 


OG 

I 


■00 


0r. 


2 
X 

TTT. 


2(b‘*+<r‘') 


^  \  2d 

ai(x) =  a 


2 
X 

TTZ 


L  Zv  2  (b‘'+  0-“)  J  b**  +  «r‘ 


=  >-rV)  - 


““I 


-00-00 


-OO  -00 


4(t^  ^  4' 

e  “  do  dy 


2  .  2 


2  2 


TR438 


-20- 


APPENDIX  II 

In  what  follows,  omitted  exponents  in  the  denominator  equal  those  in 
the  numerator. 


Let  C  be  the  MxN  rectangular  matrix  whose  n-k^^  element  is  ,  and 

consider  the  matrix^  +C  F  C  =  _A  .  Since  both  9  and  F  are  symmetric 
and  strictly  positive  definite,  one  easily  verifies  that  A  is,  also.  We  perform 
a  transformation  to  the  principal  axes  of  the  quadratic  form  in  the  exponent 
by  defining  the  new  coordinates  ^  according  to  the  matrix  relation  a  =  A  g  , 
where  A  =  (ajj^)  is  the  matrix  of  normalized  eigenvectors  of  A  .  Denoting 
the  eigenvalues  of  ^  by  V| ;  I  =  1 ,  2,  . . . ,  N,  we  obtain 


Defining 


CA 


we  can  express 


^  /  V 

a.  (x) 


in  the  form 


i=l 


N  /  "  "  dp, 


^  a  f 
-  \ 

~  L  V. 

1=1  ' 


M  M  N 


Since  X  Vn  ^  ^nk  »kf  »  this  becomes 


m=l  n=l  k=l 


A, 

a 


m=l 


M  M  N  N 

.  { X )  =  /  6.  X  ,  where  6.  =/  y  /  c,/ 

i  -  im  m  ’  im  ^  'mn  ^  nk 

n=l 


N  N 

i  'nki 

k=l  i=l 


\i  ^ki 
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APPENDIX  III 

-1  ® 

If  we  make  the  replacements  c  .  =  f  .  and  y  =  .  as 
^  mj  ^  mj  'mn  X.^ 

discussed  after  Eq.  (29),  the  expression  for  as  given  at  the  end 

of  Appendix  II  becomes 

M  e  N  N  N  N 

a  -  Y  nin  Y  .  T  ^ii^ki  _  J_  Y  -  Y 

im  ~  Zj  \  L,  ^nk  Zj  V  .  ~  \  Zj  ^mk  Zj  v. 

n=l  k=l  f=l  '  ""  k=l  f=l  ' 


Combining  this  with  Eq.  (30)  and  the  definition  of  from  Eq.  (28),  we 

obtain 

T. 


tj  k=l  i=l  *  k=l  Tj  1=1  * 


dt  =  0  . 


This  may  be  rearrange  in  the  form 

k=li=l  ' 

From  the  completeness  of  the  (t)  on  (Tj,  T^)  we  may  conclude  that 
N  N 


T, 


1 


1  1  * 


<T2. 


k=ll=l 

Writing  7(t,  s)  =  because  the  noise  is  stationary,  and  letting 

Tj-»  -  CO  and  oo,  we  may  write  the  integral  equation  (31)  in  the  form 
N  N  ^  oo 

X  Y,  "'V'~  "I  (^(t-s)Zj(s)ds  ;  t€(-oo,  00). 

k=l  i=l  ^  -00 
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Fourier  transforming  then  yields 


N  N 

I  I 


k=l  1=1 


U  R  ( f )  e  * 

''i 


^(f)  Z.(f) 


^  -jZfffT  ^ 

Since  W^(£)  =  (£)  e  £rom  (33),  and  since  ^(£)  =  (£),  the 

solution  £or  the  trans£er  function  o£  the  filter  to  be  sampled  at  t  =  T  in 

order  to  obtain  is 


Accordingly,  the  optimum  receiver  is  a  matched  filter  followed  by  a  tapped 
delay  line,  agreeing  with  Tufts's  result  as  given  in  £q.  (35).  The  tap  gain 


coefficients  are  given  by 


1-1 


and 


it  remains  to  establish  that  these 


are  equivalent  to  the  d(i,k)  given  by  Tufts.  Toward  this  end,  let  us  recall 

from  Appendix  II  that  A  is  the  matrix  of  normalized  eigenvectors  of  A , 

-1  T 

so  that  we  have  A  =  ^  >  S'Hd  we  may  write 


<1 

II 

r*  M 

i  ’•  i 

T  -1 

A  ;  A  ^  =  A 

r-u,  ) 

\o  'v^/ 
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Accordingly,  the 


i-k 


th 


element  of 


^ii  ^ki 


which  shows  that 


the  tap  gains  for  the  optimvim  receiver  prescribed  by  the  present  theory  are 
the  elements  of  the  inverse  of  A  =  9  ^  other  hand, 

temporarily  adopting  Tufts's  notation,  we  obtain  by  combining  his  Eqs.  (7) 
and  (15)  the  following  representation  for  the  matrix  D  of  his  tap  gain 
coefficients  d  ( i,  k ) 

D  =  M-C=  M-MQ  (M"^+Q)‘^  =  M- [M  -  (M"^+Q)"^]  = 

Accordingly, Tufts 's  tap  gains  are  the  elements  of  the  inverse  of  M*^+Q  . 

Since  his  M  is  our  9  ,  it  only  remains  to  show  that  his  matrix  Q  is  the 

T  - 1  th 

same  as  our  matrix  C  F  C.  Now  the  i-k  element  of  Q  is  defined, 
in  our  notation,  as 


•00 


T  1 

We  now  study  the  behavior  of  C  F”  C  in  the  limit  T,-^-oo,  T.  -►  oo,  and 

“  “  ~  _1  fi  n  ^ 

M-^oo.  First,  we  have  c  .  =  ?  and  y  ~  =  — ? —  because  we  are 

’  mj  ^mj’ 


'mn 


m 


th 


using  Karhiinen-Lioeve  expansion  coefficients,  so  that  the  i-k  element  of 
c'^F'^C  is 


M  M 

II' 

m=l  n=l 


‘mi 


mn 


m 


‘nk 


M 

=  I 

m=l 


^mi^mk 


m 
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It  is  well  known  ([3],  p.  93  or  [4],  p.  167)  that  the  Fourier  coefficients  of  a 

stationary,  Gaussian  process  become  statistically  independent  in  the  limit  of 

an  infinitely  long  expansion  interval.  Accordingly,  in  this  limit  we  may  use 

as  our  orthonormal  functions  f  ( t )  the  trigonometric  functions 

ni 


jZffmt 


t  -  T" 
^2  1 


f  ( t )  = 


m 


(VTj) 


Xfl  i  m  =  0,  +  1,  +  2,  ,  .  . 


Because  these  functions  are  complex^  some  minor  complications  arise. 

5mi 

In  particular^  defining  ^  ^  ,  we  had  in  the  real  case  that 

m 

T-IT  T-14’ 

C  r"  C  =  B  B  ;  in  the  complex  case,  this  becomes  C  F  C  =  B*B  , 

where  the  indicates  conjugate  transpose.  Moreover,  summations 

over  the  index  m  now  r\m  from  -M  to  M  rather  than  from  1  to  M.  Our 

th  T  -1 

expression  for  the  i-k  element  of  C  F  C  , therefore,  becomes 


Substituting  the  trigonometric  functions,  this  assumes  the  form 
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We  now  take  our  limit  in  such  a  way  that  as  (T^-T^)  and  m  approach 

infinity,  their  ratio  f,  representing  frequency  in  cps,  remains  finite.  In 

this  limit,  becomes  the  frequency  differential  df  and  X.  becomes 

T2-T1  m 

([4])  P-  168).  The  above  sum,  therefore,  tends  to  the  integral 


R(f)R  (f) 
“TTTJ - 


j2?rf(Tj-Tj^) 

e 


=  <lik  • 


This  completes  the  demonstration  of  the  fact  that  the  present  theory  is  in 
accord  with  Tufts's  earlier  work. 
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